Rank Aggregation in Scientific Publication Databases Based on Logistic Regression

نویسندگان

  • Martin Vesely
  • Martin Rajman
چکیده

The goal of the d-Rank project is to study rank aggregation in scientific publication databases. In our work we focus in particular on document ranking in the domain of particle physics and we work with the collection of CERN publications called the CERN Document Server. In this report we present the main advances achieved within the second phase of the project. The most important achievements notably include a creation of an extended CDS referential as an IR evaluation resource, implementation of two d-Rank software modules for query parsing and document ranking and creation of a rank aggregation framework based on logistic regression.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

D-Rank: A Framework for Score Aggregation in Specialized Search

In this paper we present an approach to score aggregation for specialized search systems. In our work we focus on document ranking in scientific publication databases. We work with the collection of scientific publications of the CERN Document Server. This paper reports on work in progress and describes rank aggregation framework with score normalization. We present results that we obtained wit...

متن کامل

تأخیر در انتشار مجله‌های علمی: مطالعه نشریات مصوب وزارت علوم، تحقیقات و فناوری ایران

: Publication delay is a negative phenomenon in scientific information dissemination. The current research studies the publication delay of scientific journals accredited by the Ministry of Science, Research & Technology of Iran. It also investigates the association between journals’ characteristics and their publication lag. This study employs the applied research method. All 1156 journals of ...

متن کامل

Supervised Kemeny Rank Aggregation for Influence Prediction in Networks

Identifying influential individuals in a network is commonly addressed through various socio-metrics like PageRank, Hub and Authority scores [1], etc. These metrics are primarily based on the actor's location in the network [2] and often captures only a subset of the critical factors that are usually at play while predicting influence in networks like, relationship of the network (type of edge)...

متن کامل

Merging Strategy Based on Logistic Regression

With the development of network technology, the users looking for information may send a request to various selected databases and then inspect multiple result lists. To overcome such multiple inspections, the database merging strategy involves the merging of the retrieval results produced by separate, autonomous servers into an effective, single ranked list. To achieve this merging, this study...

متن کامل

Analyzing ‘visual world’ eyetracking data using multilevel logistic regression

A new framework is offered that uses multilevel logistic regression (MLR) to analyze data from ‘visual world’ eyetracking experiments used in psycholinguistic research. The MLR framework overcomes some of the problems using conventional analyses, making it possible to incorporate time as a continuous variable and gaze location as a categorical dependent variable. The multilevel approach minimiz...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009